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Abstract 

An identity by Chaundy and Bullard writes 1/(1 - x] n {n- 1,2,.. .) as a sum of two truncated 
binomial series. In a paper which appeared in 2008 in Indag. Math, the authors surveyed many 
aspects of this identity. In the present paper we discuss much earlier occurrences of this iden- 
tity in works by Hering (1868), de Moivre (1738) and de Montmort (1713). A relationship with 
Krawtchouk polynomials in work by Greville (1966) is also discussed. 

1 Introduction 

In our paper 1 13 1 we surveyed the history of the often rediscovered formula 
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We attributed the formula to Chaundy & Bullard |2 p.256] (1960). However, we later learnt that some 
giant steps back in time can be made to much earlier occurrences of this formula. Almost one century 
before Chaundy & Bullard the formula was given by Hering 1 10 1 (1868). Then, with a jump of more 
than one century, the formula was found in the work of de Moivre HU (1738). Even 25 years earlier 
the formula was given in implicit form already by de Montmort 1 16) (1713). 

The paper successively discusses these three early occurences of the formulas. Next a correspon- 
dence between Samuel Pepys and Isaac Newton, having some relation with identity (D, is briefly 
discussed. We conclude with a much more recent connection with Krawtchouk polynomials which is 
implicit in Greville ID (1966). 
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2 Hering(1868) 



In 1868 Hering (TUJ p.14, formula 1)] derived: 
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Here (1 - x)~ m is the power series in x of (1 - x)~ m cut after the n-th term. Similarly, (1 - 1 - x)~" is 
the power series in 1 - x of (1 - (1 - x))~ n cut after the m-th term. Thus Hering already had {TJ- 

Hering's proof is different from any of the proofs given in {T} . For generic non-integer m he writes 
for the left-hand side of 10: 
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(3) 

where inversion of the order of summation is used in the second equality and Pfaff 's transformation 
formula in the third equality. If m tends to a positive integer, the last 2 Pi becomes 
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(Here, although not emphasized by Hering, we should require for convergence that |x - 1| > 1. This 
can later be relaxed in {2} by analytic continuation.) After multiplication of l[4} by [ m ^"^ 2 )x n /{x - 1) 
we can rewrite the first term (by inversion of the order of summation) as 
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and the second term as 
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Thus by substitution in {3} Hering settled (2}. 

Formula El is just one of many formulas derived in |10|. Hering does not specially emphasize this 
particular result. 



3 deMoivre(1738) 

A much earlier reference was kindly communicated to us by Pieter de Jong and also mentioned in his 
manuscript fl2l . In 1738 A. de Moivre HH p.196] (see also the 1754 edition [El p.224]) wrote: 

But as there is a particular elegancy for the Sums of a finite number of Terms in those 
Series whose Coefficients are hgurate numbers beginning at Unity, I shall Set down the 
Canon for those Sums. 
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Let n denote the number of Terms whose Sum is to be found, and p the rank or order 
which those figurate numbers obtain, then the Sum will be 
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which is to be continued till the number of Terms be = p. 

[In the numerator of the last term above a factor x n is missing. This may have been a 
printer's error.] 

According to de Moivre HH Corollary at end of p. 195] the figurate numbers of order p are the suc- 
cessive coefficients in the power series of (1 _ 1 x)p . Thus, since he begins at unity these are binomial 

coefficients Cjfc = 0, 1,2, . . .). This is slightly different from the modern definition given by Dick- 

son 03 p.7], who defines the /c-th figurate number of order p as the binomial coefficient ( p+ p -1 )- The 
difference is that de Moivre starts counting orders at 1 (for instance triangular numbers have order 3 
for him), while Dickson starts counting them at 0, by which triangular numbers have order 2. 
Thus by the above quotation de Moivre gives the identity 
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Indeed, this shows that de Moivre had already CO in 1738. 

The section "Of the Summation of recurring Series" starting in de Moivre 1 14, p.193] gives some 
indication how he obtained his result. We will summarize this in modern terminology and we state 
everything at once for general p instead of stating it for p = 1, 2, 3, etc. 

First de Moivre discusses infinite power series S = L^l CfcX fc in which the coefficients satisfy a 

recurrence relation Cjt = «iCfc-i + ^2Cfc-2 H h UpCk-p with coefficients aj independent of k. Then he 

observes that S = q[x)/{l - a\X a p x p ), where q{x) is a polynomial of degree at most p-1 which 

can be explicitly computed. Next he observes that the figurate numbers 
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being polynomials of degree p 
p-th finite difference: 



1 in k which vanish for k = —l, -2, . . . , -p + 1, are annihilated by the 
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From this he derives that S = (1 

-■71-1 „ „fc 



x)~P in this case. Finally he applies the same method to terminating 

power series S n = L^Zq c^x K . A recurrence relation - a\Ck-\ + aiCk-2 H 1~ ®pCk-p will then yield 

S n = {q{x) + x" r(x))/(l - a\x a p x p ) for certain polynomials q{x) and r(x) of degree at most 

p-1. For Cfc being the figurate numbers we get S n = (1 + x"r(x))/(l - x) p , from which r(x) can be 
computed, in principle. However, de Moivre does not give an argument how he arrives at the nice 
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explicit expression for r{x) expanded in powers of 1 - x. Probably, he found the expression for low 
values of p and then extrapolated. 

De Moivre also made an important step by which he might have concluded the multi-variable 
generalization of (TJ given in [131 (1.5)] and first obtained (as far as we know) by Damjanovic, Klamkin 
and Ruehr ED in 1986. In fact, de Moivre QH Problem LXIX, p.191], EH pp.50,51] gave for any of the 
n summands in the outer sum in (13J (1.5)] a probabilistic interpretation coming from the problem of 
points with n players (see the case of 2 players below) . Adding up these chances to 1 would have given 
him the multi-variable formula, just as was pointed out in 1 19 1 (in detail in a situation with three urns 
by Bosch and Steutel). We are puzzled why de Moivre missed this final step, and also why he did not 
give a probabilistic interpretation of (TJ. 



4 de Montmort (1713) 

In 1713 appeared the second edition of the Essay d'analyse sur les jeux de hazard 1 16| by Pierre Ray- 
mond de Montmort. It contained among others a new solution of the so-called problem of points for 
two players. This problem comes from a game of chance with two players Pierre and Paul who have 
chances p and 1 - p, respectively of winning each round. The player who has first won a certain num- 
ber of rounds (this number may be different for Pierre and Paul) will collect the entire prize. Suppose 
that the game is prematurely interrupted when Pierre has to win still n rounds and Paul m rounds. 
What is then a fair division of the stake? See Hald |9J §14.1] for a description how this problem was 
handled by de Montmort. 

In the case of equal chances the problem was already solved by Pascal and Fermat in 1654. In 
the case of unequal chances Johann Bernoulli generalized their solution. Bernoulli gives his solution 
in a letter to de Montmort dated 17 March 1710. This letter is included in the second edition of de 
Montmort's book, see (16, pp.283-298], in particular p.295 (English translation available at [T7]). De 
Montmort also gives this solution in his main text, see |T6l pp. 244-245, §190] (English translation 
at (121). Curiously, Bernoulli is not mentioned there by de Montmort. Neither he acknowledges this 
new result of Bernoulli in his polemical discussion of earlier work on the problem of points in the 
Avertissement of the second edition of his book. This discussion starts on p. xxxiv of 1 16 1 (English 
translation available at 1 17|). Bellhouse 1 1 1 gives an interesting discussion of the relationship between 
de Montmort and de Moivre. 

Bernoulli's solution is as follows. Imagine Pierre and Paul still play m + n-l rounds. Then there 
will certainly be a winner. Pierre will be the winner if he has won n or more of these rounds. His 
chance for this is 
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But if Pierre has won n - 1 or less of these rounds then Paul will be the winner. The chance for this is 
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The two chances add up to 1, both by the probabilistic interpretation and by the binomial formula. 

De Montmort 1 16, p.245, §191] continues to give other expressions for the chances for Pierre re- 
spectively Paul to win. Imagine they still play until there is a winner. Pierre will be the winner if he has 
won already n-l rounds and Paul at most m-1 rounds, and if then the next round is won by Pierre. 
Thus the chance for Pierre to win is 
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Similarly for Paul the chance to win is 
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These two chances necessarily add up to 1. Thus the resulting formula (not given by de Montmort) is 
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by which formula QJ is proved in a probabilistic way. This is essentially the same proof as was quoted 
from much more recent literature in [13J end of Section 6] . 

Clearly the chances l[5) and iTTJ are the same. This is not explicitly observed by de Montmort, but 
it is indicated in the example where n = 5 and m = 3. In the general case the resulting identity is (HJ 
(2.7)]. There we referred to Guenther |8|, who gave various proofs and references (but none older than 
1933) for this identity, including the probabilistic proof we just observed. 



5 Pepys and Newton ( 1 693) 

In 1693 Samuel Pepys wrote a letter to Isaac Newton with a question about a probabilistic problem 
coming from a question to Pepys by lohn Smith (see 1 2 1 , 1 1 8 1 ) . The question was (in modern terms) : 

Let 6k fair dice be tossed independently and suppose that at least k "6"'s appear. For 
which k = 1,2,3 this has the greatest chance to happen? 

Newton wrote back three times. He answered correctly that the case k = 1 has the highest probability. 
He actually computed the probability for k = 1 and 2. He also gave a theoretical argument about 
which Stigler |3|, as late as 2006, observed that it was incorrect. Chaundy & Bullard |2| showed more 
generally: 

We work with fair dice with s faces. Let g{sn, n) be the chance that a selected face turns 
up less than n times in sn throws. Then g{sn, n) increases with n for fixed 5. 

They proved this statement by expressing g{sn, n) in terms of J5]l and then using that is equal to 
10 . (In passing, in connection with their proof of this identity, they observed the identity QJ .) Finally, 
by working with they could prove their claim quoted above. 
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6 Greville (1966) 

This last item does not push the history of identity {TJ further back, but mentions an unexpected 
aspect of this identity which is offered by Greville |7, p. 166]. A further description is given in |6, §4,6]. 
Greville considers the smoothing filter / — ► g given by 

N 

g(y)= E f(.y-x)K 2 „{x,0)w{x) (yeZ), 

x=-N 

where w (x) := (j^ +x ) and K„ is the Christoffel-Darboux kernel for the orthogonal polynomials p n sat- 
isfying 

N 

£ p n ix)p m {x)wix) = h n 6„ im {n,me{0,l,...,2N}). 

x=-N 

Then the polynomials p n are special shifted Krawtchouk polynomials 

p n {x) = K n {x + N;\;2N), 

but this is not explicitly mentioned by Greville. Then we also see that h n = 2 2N {™) and that 

K 2 „(x,0) = E / • 
fc=0 ^fc 

Greville wants to compute the characteristic function (or transfer function) </> associated with this 
smoothing filter, given by 

N 

4>{fo):= £ K 2 „(x,0)e-^ x . 

x=-W 

Then he derives that 

(p{oj) = 1 - (sin 2 (w/2)) n+1 P(sin 2 (w/2)) = (cos 2 (w/2)) Ar_n Q(sin 2 (w/2)) (9) 

for certain polynomials P of degree N - n-l and Q of degree n. Then, with the same argument as 
in (H Section 6.1] and 1 13 , Remark 2.2], Greville explicitly obtains P and Q. As a consequence, ® 
takes the form of CD with m = N- n - 1. Greville also concludes from the explicit expression that (/> is 
monotonically decreasing from 1 to on [0, n] . Later Herrmann 1 1 1 1 independently computed (9} in 
a different way in order to arrive at this result of monotonical decrease of <p, which he called maximal 
flatness. 
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